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1 Introduction 


The situation calculus (McCarthy [17]) has long been a staple of AI research, 
but only recently have there been attempts to carefully axiomatize it. The 
variant described here has evolved in response to the needs of the Cognitive 
Robotics Project at the University of Toronto, and its foundations are now 
sufficiently developed to be gathered in one place. That is what this paper 
sets out to do. 

The principal intuition captured by our axioms is that situations are 
histories — finite sequences of primitive actions — and we provide a binary 
constructor do(a, s) denoting the action sequence obtained from the history 
s by adding action a to it. Other intuitions are certainly possible about the 
nature of situations. McCarthy and Hayes [19] saw them as ’snapshots” of a 
world. For the purposes of representing narratives, McCarthy and Costello 
[18] wish to express that any number of actions may occur between a situa- 
tion and its ”successor”, and therefore they do not appeal to a constructor 
like do. 

The axioms and supporting metatheory presented in this paper have 
provided foundations for a wide range of practical work in modeling dynami- 
cal systems, including planning (Green [6], Levesque [14]), control (Levesque 
et al [15], De Giacomo et al [4]), simulation (Kelley [10]), database updates 
(Reiter [27], Bertossi et al [2]), diagnosis of dynamical systems (Mcllraith 
[20]), and agent programming and robotics (Jenkin et al [9], Lespérance et 


al [13], Burgard et al [3], Shapiro et al [81], Reiter [29]). 


2 The Language of the Situation Calculus 


The following description is taken from Pirri and Reiter [24]. Leiteaic is a 
second order language with equality. It has three disjoint sorts: action for 
actions, situation for situations, and a catch-all sort object for everything else 
depending on the domain of application. Apart from the standard alphabet 
of logical symbols — we use A, 7 and J, with the usual definitions of a full 
set of connectives and quantifiers — Le jteaie has the following alphabet: 


e Countably infinitely many individual variable symbols of each sort. 
We shall use s and a, with subscripts and superscripts, for variables 
of sort situation and action, respectively. We normally use lower case 
roman letters other than a,s, with subscripts and superscripts for 
variables of sort object. In addition, because Lej¢eqte 18 second order, 
its alphabet includes countably infinitely many predicate variables of 
all arities. 


e Two function symbols of sort situation: 


1. A constant symbol Sy, denoting the initial situation. 


2. A binary function symbol do : action x situation — situation. 
The intended interpretation is that do(a, s) denotes the succes- 
sor situation resulting from performing action a in situation s. 


e A binary predicate symbol C: situation x situation, defining an order- 
ing relation on situations. The intended interpretation of situations 
s’ means that s is a proper 


is as action histories, in which case s C 
subhistory of s’. 
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A binary predicate symbol Poss : action x situation. The intended 
interpretation of Poss(a,s) is that it is possible to perform the action 
a in situation s. 


For each n > 0, countably infinitely many predicate symbols with ar- 
ity n, and sorts (actionU object)”. These are used to denote situation 
independent relations like 

human( John), primeNumber(n), 

moving Action(run(person, locl, loc2)), etc. 


e For each n > 0, countably infinitely many function symbols of sort 
(action U object)” — object. These are used to denote situation inde- 
pendent functions like 


sqrt(a), height(Mt Everest), agent(run(person, locl, loc2)), etc. 


For each n > 0, a finite or countably infinite number of function 
symbols of sort (action U object)” — action. These are called action 
functions, and are used to denote actions like pickup(x), move(A, B), 
etc. In most applications, there will be just finitely many action 
functions, but we allow the possibility of an infinite number of them. 


Notice that we distinguish between function symbols taking values 
of sort object and those — the action functions — taking values of 
sort action. In what follows, the latter will be distinguished by the 
requirement that they be axiomatized in a particular way by what we 
shall call action precondition axioms. 


e For each n > 0, a finite or countably infinite number of predicate 
symbols with arity n +1, and sorts (action U object)” x situation. 
These predicate symbols are called relational fluents. In most ap- 
plications, there will be just finitely many relational fluents, but we 
do not preclude the possibility of an infinite number of them. These 
are used to denote situation dependent relations like ontable(x, s), 
husband(Mary, John, s), etc. Notice that relational fluents take just 
one argument of sort situation, and this is always its last argument. 


e For each n > 0, a finite or countably infinite number of function 
symbols of sort (actionUobject)” x situation — actionUobject. These 
function symbols are called functional fluents. In most applications, 
there will be just finitely many functional fluents, but we do not 
preclude the possibility of an infinite number of them. These are 
used to denote situation dependent functions like 


age(Mary, s), prime Minister(Italy, s), etc. 


Notice that functional fluents take just one argument of sort situation, 
and this is always its last argument. 


Notice that only two function symbols of Lejteaie — So and do — are permitted 
to take values in sort sttuatzon. 


3 Foundational Axioms for Situations 


Here, we focus on the domain of situations. The primary intuition about 
situations that we wish to capture axiomatically is that they are finite se- 
quences of actions. We want also to be able to say that a certain sequence of 
actions is a subsequence of another. To formalize these desiderata, we adopt 
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the following four foundational axioms for the situation calculus, taken from 


[24]. 


do(a1, $1) = do(ag, 82) D a, = ag A581 = $2, (1) 
(VP).P(So) A (Va, 8)[P(s) D P(do(a, s))] D (Vs)P(s). (2) 


Axiom (1) is a unique names axiom for situations; two situations are 
the same iff they are the same sequence of actions. Two situations 5, and 
So may be different, yet assign the same truth values to all fluents. So a 
situation in the version of the situation calculus presented here must not be 
identified with the set of fluents that hold in that situation, i.e with a state. 
The proper way to understand a situation is as a history, namely, a sequence 
of actions; two situations are equal iff they denote identical histories. The 
second axiom (2) is second order induction on situations. The importance 
of induction for the situation calculus is described by Reiter [26]. 

There are two more axioms: 


as80 So, (3) 
sCdo(a,s')=sCs’. (4) 


Here s C s’ is an abbreviation for s C s’ Vs = s’. The relation C provides 
an ordering relation on situations. Intuitively, s C s’ means that the action 
sequence s’ can be obtained from the sequence s by adding one or more 
actions to the front of s.? 

The above four axioms are domain independent. They provide the basic 
properties of situations in any domain specific axiomatization of particular 
fluents and actions. Henceforth, we shall call them ©.? 


3.1 Time, Processes and Concurrency in the Situation 
Calculus 


As presented above, the foundational axioms for situations provide a purely 
qualitative notion of time whose only temporal concept is sequential action 
occurrence: An action occurs before or after another within a situation. 
There is no way of expressing that an action occurs at a particular time, or 
that two or more actions occur concurrently. While we do not present them 
here, it is possible to extend these axioms for situations to accommodate 
time and concurrency. See, for example, Pinto [23] and Reiter [28], and 
for a discussion of how to model processes in the situation calculus using 
interleaving concurrency, see Reiter [29]. 


‘In what follows, lower case Roman characters will denote variables in formulas. 
Moreover, free variables will always be implicitly universally prenex quantified. 

? Readers familiar with the programming language LISP will have noticed that 
in the situation calculus, the constant So is just like NIL, and do acts like cons. 
Situations are simply lists of primitive actions. For example, the situation term 
do(C, do(B,do(A, So))) is simply an alternative syntax for the LISP list (C B A) 
(= cons(C,cons(B,cons(A,ntl)))). Notice that to obtain the action history cor- 
responding to this term, namely the performance of action A, followed by B, 
followed by C, we read this list from right to left. Therefore, when one reads 
situation terms from right to left, the relation s C s’ means that situation s is a 
proper subhistory of the situation s’. The situation calculus induction axiom (2) 
is simply the induction principle for lists: If the empty list has property P and if, 
whenever list s has property P so does cons(a,s), then all lists have property P. 

°These foundational axioms are simpler than those presented by Reiter [27] and 
others. These earlier axiomatizations for the situation calculus are all derivable 
from the four axioms given here. 
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4 Domain Axioms and Basic Theories 
of Actions 


Our concern here is with axiomatizations for actions and their effects that 
have a particular syntactic form. These are called basic action theories, and 
we next describe these. 


Definition 4.1 The Uniform Formulas 
Let o be a term of sort situation. The terms of Leiteaie uniform in o are 
the smallest set of terms such that: 


1. Any term that does not mention a term of sort situation is uniform 
ino. 

2. 0 is uniform in o. 

3. If g is an n-ary function symbol other than do and So, and t),...,t, 


are terms uniform in o whose sorts are appropriate for g, then 
g(ti,...,¢,) is a term uniform in o. 


The formulas of Lejteatc uniform in o are the smallest set of formulas such 
that: 


1. If t; and t. are terms of the same sort object or action, and if they 
are both uniform in o, then t, = fg is a formula uniform in o. 


2. When P is an n-ary predicate symbol of Lejteate, other than Poss and 


C, and t,,...,t, are terms uniform in o whose sorts are appropriate 
for P, then P(t1,...,t,) is a formula uniform in o. 


3. Whenever U,,U2 are formulas uniform in a, so are 3U;,, U, A U2 
and (dv)U, provided v is an individual variable, and it is not of sort 
situation. 


Thus, a formula of £sjteagte 18 uniform in o iff it is first order, it does 
not mention the predicates Poss or C, it does not quantify over variables 
of sort situation, it does not mention equality on situations, and whenever 
it mentions a term of sort sztuatzon in the situation argument position of a 


fluent, then that term is o. 


Definition 4.2 Action Precondition Axiom 
An action precondition axiom of Lejteaic 18 a sentence of the form: 


Poss(A(@1,...,€n), 8) = Ma(a,..-,@n, 8), 


where A is an n-ary action function symbol, and II4(#1,---,a@p,8) is a 
formula that is uniform in s and whose free variables are among 21,..., pn, 8. 


For example, in a blocks world, we might typically have: 


Poss(pickup(«), 5) = (Vy)7holding(y, 8) A aheavy(a, 8). 


The uniformity requirement on II, ensures that the preconditions for 
the executability of the action A(a,,...,#%,) are determined only by the 
current situation s, not by any other situation. 
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Definition 4.3 Successor State Axiom 


1. A successor state axiom for an (n + 1)-ary relational fluent F is a 
sentence of Lyjteate of the form: 


F(a,...,%n,do(a, s)) = Op(a1,...,%n,4, 8), (5) 
where ®p(x1,...,%n,4,8) is a formula uniform in s, all of whose free 
variables are among 4, 8,21,...,%,. An example of such an axiom, 


taken from [25], is: 
broken(x, do(a, s)) = 
(Ar){a = drop(r, x) A fragile(x, s)} V (4b){a = explode(b) A 
nexto(b, x, s)} V 
broken(a, s) An(dr)a = repair(r, x). 


This says that x will be broken in the successor situation do(a, s) iff x 
was fragile in s and the action taking us to the successor situation was 
someone (7) dropping x, or the action was some bomb 6 exploding, 
and 6 was next to x, or x was already broken, and the action was not 
someone repairing 2. 


As for action precondition axioms, the uniformity of Br guarantees 
that the truth value of F(#1,...,%n,do(a,s)) in the successor situa- 
tion do(a, s) is determined entirely by the current situation s, and not 
by any other situation. In systems and control theory, this is often 
called the Markov property. 


2. A successor state axiom for an (n + 1)-ary functional fluent f is a 
sentence of Lyjteate of the form: 


f(@1,...,®n, do(a, s)) =Sy= (@1,..-,2n,Y, 4, $), 


where $;(#1,...,2n,¥,4,8) is a formula uniform in s, all of whose 
free variables are among £1,...,%,y,a,s. A blocks world example 
1s: 

height(, do(a,s)) = y = a= moveToTable(z) Ny=1 V 
(dz, h)(a = move(a,z) A height(z,s) =hAy=h+1) Vv 
height(«,s) = yAa £ moveT oT able(x)A7(dz)a = move(x, z). 


As for relational fluents, the uniformity of ¢; in the successor state 
axioms for functional fluents guarantees the Markov property: The 
value of a functional fluent in a successor situation is determined en- 
tirely by properties of the current situation, and not by any other 
situation. 


Following earlier ideas of Pednault [22], Haas [7] and Schubert [30], Reiter 
[25] shows how to solve the frame problem for deterministic actions.4 The 
resulting solution yields axioms with exactly the syntactic form of successor 
state axioms. 


Basic Action Theories Henceforth, we shall consider theories D of 
Lsiteate of the following forms: 

D = SUD; UD ap UPunaU Ds, 
where, 


“This solution does not take ramification constraints into account, but see 
[16, 20, 23] for possible ways to do this, while preserving the successor state 
axiom approach. 
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¥ are the foundational axioms for situations. 


e D,, is a set of successor state axioms for functional and relational 


fluents, one for each such fluent of the language Loiteate. 


symbol of Leiteate- 


of L sitcale: 


sentence of Ds, quantifies over situations, or mentions Poss, 


Dap 1s a set of action precondition axioms, one for each action function 
Puna 18 the set of unique names axioms for all action function symbols 


Ds, is a set of first order sentences that are uniform in So. Thus, no 


C or the 


function symbol do, so that So is the only term of sort situation men- 
tioned by these sentences. Ds, will function as the initial theory of 
the world (i.e. the one we start off with, before any actions have been 
“executed” ). Often, we shall call Ds, the initial database. The initial 
database may (and often will) contain sentences mentioning no situ- 
ation term at all, for example, unique names axioms for individuals, 
like John # Mary, or “timeless” facts like is Mountain( MtEverest), 


or dog(x) D mammal(«). 


Definition 4.4 A basic action theory is any collection of axioms PD of the 
above form that also satisfies the following functional fluent consistency 


property: 


Whenever f is a functional fluent whose successor state axiom in D,, is 


L(, do(a, s)) =y= os (%, ¥, 4,8), 
then 
Puna UDs, (Va, s).(Vi).(Jy)oy (2, y,a,8) A 


This consistency property provides a sufficient condition for preventing 


a source of inconsistency in f’s successor state axiom. It says that 


the con- 


ditions defining f’s value in the next situation do(a,s), namely ¢;, actually 


define a value for f, and that this value is unique. 


5 Metatheory for the Situation Calculus 


Here we present some fundamental logical properties of basic action theories. 


These are described more fully, with accompanying proofs, in [24] 


5.1 Relative Satisfiability 


Basic action theories enjoy an important relative satisfiability property: 


Theorem 1 A basic action theory D is satisfiable off Dung UDs, 18. 


5.2 -Reduction 


For the purposes of automating deduction in the situation calculus, the 
foundational axioms © are problematic, especially the induction axiom. Ac- 
cordingly, it would be desirable to characterize broad classes of sentences 
whose proofs need never appeal to induction, or even better, can addition- 


ally ignore some of the other axioms of %, or best of all, need never appeal 
to any of the foundational axioms ©. The main result along these lines is 


the following: 
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Theorem 2 (X-Reduction Theorem) 

Suppose that D = NUDss UDap U Puna UDs, ts a basic action theory. 
Suppose further that w is a first order sentence of Lsitcaic whose prener 
normal form has the form 


Q1(As1)Q2(As2)--- (Asn )Qne, n> 0, 
where 81,...,8, are variables of sort situation, ¢ is quantifier free, and 
each Q; ts a sequence of zero or more quantifiers over variables of sort 
action U object. Then, 


1LDEwW iff D— {Induction} 5 uv. 


Here, D—{Induction} is D without the second order induction axiom. 


2. If does not mention the predicate symbol C, 
DEwv aff S2UDssU Dap UPuna UP 5, FY. 


Here, = consists of the foundational axiom (1) together with the 
following sentence: 


So # do(a, s). 


3. Ifw does not mention the predicate symbol C and it does not mention 
an equality atom over terms of sort situation, 


DEwv of Dss Dap UPuna UPS, Fv. 


5.3 Regression 


Regression is perhaps the single most important theorem-proving mecha- 
nism for the situation calculus; it provides a sytematic way to establish 
that a basic action theory entails a so-called regressable sentence. 


Definition 5.1 The Regressable Sentences. A regressable sentence of 
the situation calculus is a first order sentence W of Leiteate such that W 
does not quantify over situations, and such that for every atom of the form 
Poss(a,o) mentioned by W, a@ has the form A(t1,...,t,) for some n-ary 
action function symbol A of Lsjteaie- 


The essence of a regressable sentence is that each of its situation terms is 
rooted at Spo, and therefore, one can tell, by inspection of such a term, 
exactly how many actions it involves. It is not necessary to be able to 
tell what those actions are, just how many there are. In addition, when a 
regressable sentence mentions a Poss atom, we can tell, by inspection of 
that atom, exactly what is the action function symbol occurring in its first 
argument position, for example, that it is a move action. 

The intuition underlying regression is this: Suppose we want to prove 
that a regressable sentence W is entailed by some basic action theory. Sup- 
pose further that W mentions a relational fluent atom F(f, do(a,o)), where 
F’s successor state axiom is f'(@, do(a, s)) = Op(#,a,s). Then we can eas- 
ily determine a logically equivalent sentence W’ by substituting @ p(t, a,c) 
for F(t, do(a,o)) in W. After we do so, the fluent atom F(t, do(a,o)), in- 
volving the complex situation term do(a,o), has been eliminated from W 
in favour of Op (f, a,o), and this involves the simpler situation term o. In 
this sense, W’ is “closer” to the initial situation So than was W. Moreover, 
this operation can be repeated until the resulting goal formula mentions 
only the situation term So, after which, intuitively, it should be sufficient to 
establish this resulting goal using only the sentences of the initial database. 
Regression is a mechanism that repeatedly performs the above reduction 
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starting with a goal W, ultimately obtaining a logically equivalent goal Wo 
whose only situation term is Sp. We have only indicated how regression 
works by reducing relational fluent atoms in W; there is an analogous way 
of reducing functional fluent terms. 

The full definition of the regression operator R is rather lengthy, mainly 
because functional fluents introduce certain complications, and we omit it 
here. See [24] for details. The principal result for regression is the following: 


Theorem 3 (Soundness and Completeness of Regression) 
Suppose W is a regressable sentence of Leitcate and D is a basic theory of 
actions. Then, 


1. R[W] is a sentence uniform in So. 


2 DEW iff Ds, UDuna E R[W]. 


This result shows how, using regression, one can reduce the problem 
of proving a regressable sentence in a basic action theory to an entailment 
problem relative to the initial database and the unique names axioms for 
actions. The foundational axioms © of the situation calculus are not re- 
quired for establishing this entailment. The action precondition and succes- 
sor state axioms are also not required; their effects have been compiled into 
the regressed formula. Regression is the central computational mechanism 
underlying implementations of the logic programming language GOLOG, 
and we now turn to this. 


6 GOLOG 


GOLOG is a situation calculus-based logic programming language for im- 
plementing complex behaviours for dynamical systems. The language was 
first introduced in [12, 15], and was described there without pinning down 
all its formal details. Here, we present the syntax and logical foundations 
for the language, most of which is taken from Pirri and Reiter [24]. 


6.1 The Syntax of GOLOG Programs 


For the purposes of specifying the syntax of the programming language 
GOLOG, we require a language that is just like £ejteate, except it suppresses 
all references to situations. 


Definition 6.1 Situation-Suppressed Terms and Formulas 
The situation-suppressed terms obtained from Lejteate are inductively de- 


fined by: 


1. Any variable of £yjteaic of sort action or object is a situation-suppressed 
term. 


2. If f is an (n+ 1)-ary functional fluent of Leiteaic, and t1,...,tn are 
situation-suppressed terms of sorts appropriate for the first n argu- 
ments of f, then f(t1,...,¢,) is a situation-suppressed term. 


3. If g is an m-ary non-functional fluent symbol of Lejteate other than Sp 
or do, and t,,...,tm are situation-suppressed terms of sorts appropri- 
ate for the arguments of g, then g(t1,...,tm) is a situation-suppressed 
term. 
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The situation-suppressed formulas obtained from L;teqie are Inductively de- 


fined by: 


1. 


Whenever ¢ and ?¢’ are situation-suppressed terms of the same sort, 
then t = ?’ is a situation-suppressed formula. Because situation- 
suppressed terms are never of sort situation, the situation-suppressed 
formulas never mention an equality atom between terms of sort situ- 
ation. 


When t is a situation-suppressed term of sort action, then Poss(t) is 
a situation-suppressed formula. 


When F is an (n + 1)-ary relational fluent symbol of Leiteaice and 


t1,...,t, are situation-suppressed terms of sorts appropriate for the 
first n arguments of F’, then F(ti,...,¢n) is a situation-suppressed 
formula. 


When P is an m-ary non-fluent predicate symbol of £,jteaic other than 


C, and t,,...,tm are situation-suppressed terms of sorts appropriate 
for the arguments of P, then P(t1,...,tm) is a situation-suppressed 
formula. 


When ¢ and w are situation-suppressed formulas, so are =¢ and dAv. 
When v is a variable of Leiteaic of sort action or object, then (dv)¢ is 
a situation-suppressed formula. 


Therefore, situation-suppressed formulas are first order, never quantify over 


situations, never mention C 


tion. 


nor do they ever mention terms of sort situa- 


—) 


We can now describe the syntax of GOLOG programs: 


(program) ::= (primitive action) | 


Here, 


(test condition)? | 

({(program); (program)) | 
((program) | (program)) | 

(rx) (program) | 

(procedure call) | 

(proc P; (@,) (program) endProc ; 


proc P,, (&) (program) endProc ; 
(program)) 


. (primitive action) is a situation-suppressed term of sort action. 


. {test condition) is a situation-suppressed formula. 


In (w2){program), « must be a variable of Leitcatc of sort action or 
object. 


(procedure call) must have the syntactic form P(t1,...,t,), where P 
is an (n+2)-ary predicate variable of Leiteaic whose first n arguments 
are not of sort situation, and whose last two arguments are of sort 
situation. Moreover, t1,...,t, must be situation-suppressed terms 
whose sorts are appropriate for the first n arguments of P. 


In a procedure declaration proc P (#@) (program) endProc, P must 
be an (n+2)-ary predicate variable of Lsitcate whose first n arguments 
are not of sort situation, and whose last two arguments are of sort 
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situation. Moreover, v must be variables of Leitegic whose sorts are 
appropriate for the first n arguments of P. 


Notice that GOLOG provides for arbitrary nesting of blocks with procedure 
declarations local to their block. Other control structures, e.g. conditionals 
and while loops, can be defined in terms of the above constructs: 


if (test condition) then (programl1) else (program?) af 


(test condition)? ; (program) | a(test condition)? ; (program?) 


To define while loops, first introduce a nondeterministic iteration operator 
*, where (program)* means do (program) 0 or more times: 


(program)* act proc P() true? | [(program); PQ] endProc; P() 


* 


Then while loops can be defined in terms of the * operator: 


while (test condition) do (program) endWhile af 


[(test condition)? ; (program)]* ; a(test condition)? 


Just as conventional Algol-like programming languages never explicitly 
refer to the state of their computation, GOLOG programs never explicitly 
mention situations, hence the emphasis above on situation-suppressed terms 
and formulas. In defining the semantics of such programs, these situations 
are taken into account, as we now describe. 


6.2 The Semantics of GOLOG 


Definition 6.2 Restoring Suppressed Situation Arguments 
Whenever ¢ is a situation-suppressed term and o is a term of Ljteaic of sort 
situation, t{o] denotes that term of Lsiteate obtained by restoring the term o 
as the situation argument to all of the functional fluents terms mentioned by 
t. In addition, whenever ¢ is a situation-suppressed formula, ¢[o] denotes 
that formula of Lesitcaie obtained by restoring the term o as the situation 
argument to all of the functional fluent terms and all of the relational fluent 
atoms mentioned by @. 


Next, we define an abbreviation Do(6, 0,0’), where 6 is a program ex- 
pression, and ¢ and o” are terms of sort situation. Do(6,o,0’) should be 
viewed as a macro that expands into a second order situation calculus for- 
mula; moreover, that formula says that situation o' can be reached from 
situation o by executing some sequence of actions specified by 6. Note that 
our programs may be nondeterministic, that is, may have multiple execu- 
tions terminating in different situations. 

Dois defined inductively on the structure of its first argument as follows: 


1. Primitive actions: When a is a situation-suppressed term of sort 
action, 


Do(a,o,o') aef Poss(alo],c) Ao’ = do(ale],c). 


2. Test actions: When ¢ is a situation-suppressed formula, 


Do(61; 62,0, 0°) af (As).Do(61,¢,8) A Do(éa, 8,0’). 
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4. Nondeterministic choice of two actions: 
Do(61 | 62,0, 0°) af Do(61,0,0') V Do(6s,¢, 0’). 


5. Nondeterministic choice of action arguments: 


Do((r2) 6,0, 0°) (J (Ar) Do(6, 0,0"). 


ane 


6. Procedure calls: For any predicate variable P of arity n+2 whose first 
n arguments are not of sort situation, and whose last two arguments 
are of sort situation: 


Do(P(ti,...,tn), 0,0) P(ti[o],...,trle],o, 0’). 


7. Blocks with local procedure declarations: 


Do({proc P; (#) 6; endProc ; --- ; 
proc P, (t,) 6, endProc ; 69},¢,0’) 


def (VP, . ., Pa) .[]\ (Ws, 8", 8) Do(d;, 5, 5’) >) P,(%;, 8, 8’)] 


t=1 
> Do(69, ¢, 0’). 


Notice that in the definition for procedure calls, the actual parameters (¢;) 
are first evaluated with respect to the current situation o (t;[¢]) before 
passing them to the procedure P, so GOLOG’s procedure invocation is call 
by value. 

Except for procedures, the above macro approach to defining the seman- 
tics of GOLOG draws considerably from dynamic logic [5]. In effect, it reifies 
as situations in the object language of the situation calculus, the possible 
worlds with which the semantics of dynamic logic is defined. The macro defi- 
nition for GOLOG procedures corresponds to the more usual Scott-Strachey 
least fixed-point definition in standard programming language semantics [32]. 

With the above definition of Do(6,o,o’) in hand, we are in a position 
to define what we mean by the evaluation of a GOLOG program. 


6.3. The Evaluation of GOLOG Programs 


Definition 6.3 Proper GOLOG Program 

A GOLOG program 6 is proper iff s and s’ are the only free variables (in- 
dividual or predicate) mentioned in Do(é, s,s’). In other words, for each 
individual variable z mentioned in 6, « lies in the scope of (Va) or (4a) in 
a test condition of 6, or x is mentioned in the scope of a nondeterminis- 
tic choice operator (7). Moreover, for every procedure call P(t1,...,tn) 
mentioned in 6, the second order variable P is bound in 6 by a procedure 
declaration for P in some block surrounding the procedure call. 


A proper GOLOG program is evaluated relative to a background basic 
theory of actions specifying a particular application domain. Specifically, if 
PD is such a basic action theory, and 6 is a proper GOLOG program, then 
the evaluation of 6 relative to D is defined to be the task of establishing the 
following entailment: 


D E& (As) Do(6, So, 8). 


Any binding for the existentially quantified variable s obtained as a side 
effect of such a proof constitutes an execution trace of 6. In this respect, 
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the evaluation of a GOLOG program is much like that of a Prolog goal 
statement, in the sense that both are evaluated by a theorem-prover for the 
purposes of obtaining bindings to existentially quantified variables of the 
theorem. Of course, in the case of GOLOG, this theorem-proving task is 
much more daunting, because in general, (4s) Do(6, So, s) stands for a very 
complicated second order sentence of Lej¢eate, and moreover, PD includes 
a second order induction axiom. However, GOLOG enjoys a strong »- 
Reduction property that somewhat simplifies this task, as we now describe. 


6.4 -Elimination for GOLOG 


As defined above, a GOLOG program is evaluated relative to a basic action 
theory D. One can prove [24] that for this purpose, the foundational axioms 
for situations contained in PD are not needed: 


Theorem 4 (-Elimination Theorem for GOLOG) 
Suppose that D = MUDs; UDap UPuna UDs, ts a basic action theory and 
that 6 1s a proper GOLOG program. Then, 


D FE (ds)Do(6, 50,8) if Dss UPap UPuna UPs, F (ds) Do(6, So, 8). 


7 Sensing and Knowledge 


To talk about knowledge explicitly in the language of the situation calculus, 
we follow Moore [21] and suppose that there are situations other than Sp and 
its successors that serve as epistemic alternatives to the “real” situations. 
A fluent is considered to be known in a situation s if it holds in all the 
epistemic alternatives to s. To allow knowledge to change as the result of 
performing actions, we imagine that certain actions return sensing results 
(here taken to be binary). What is known in a situation s will be fully 
determined by what was known initially, the actions that led to situation 
s, and the sensing results of those actions. The material in this section is 
adapted from Lakemeyer and Levesque [11]. 

The language of the epistemic situation calculus £. is Lsiteate augmented 
by two new distinguished symbols: Ko, a unary predicate over situations, 
and SF(a,s), a binary predicate over actions and situations. Informally, 
Ko(s) holds when s is an epistemic alternative to So, and SF(a,s) holds 
when action a returns binary sensing result | in situation s. 

The foundational axioms for £, describe the augmented set of situations. 
First, for any situation term ¢, let Init(a) be the abbreviation 


Init(c) = (Ws, a). o # do(a,s). 
where a and s do not appear ino. The first axiom is 
do(a1, $1) = do(ag, 82) D a, = ag A581 = $2, 
just as it was in %, and 
(VP) {(Vs)[Init(s) D P(s)] A (¥s,a).P(s) D P(do(a,s))} D (Vs).P(s), 


which is similar to its counterpart in %, stating that all situations are suc- 
cessors of some initial one. Next, we need axioms that define the C relation: 


Init(s') Das CT 8’, 
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which is an analogue of the one in ©, and 


sCdo(a,s')=sCs', 


which is unchanged. Next, we need to say that Sp and Ko are initial situa- 
tions: 


Init(So) A (Vs).Ko(s) D Init(s). 


Finally, we need to characterize the set of initial situations. To ensure that 
nothing need be known about the fluents initially, we insist that there be 
initial situations (to serve as possible epistemic alternatives) for all combi- 
nations of fluent values. To keep things simple here, let us assume that there 
are only finitely many relational fluents F,...,F, € £e, and no functional 
fluents. We then use the following abbreviation: 


FV(P1,-+-5 Pas Pais Pate,s) & 


Nini (VE) [Fi (Zi, 5) = Pi(Z)] 
A (Va)[Poss(a,s) = Pazi(a)] A (Va)[SF (a, 5) = Pryo(a)]. 


The next foundational axiom is then 


(VP)(ds). Init(s) A FV(P,s). 


This is a variant of the existence of situations axiom first introduced by 
Baker [1]. For some applications it will be desirable to have additional ini- 
tial situations where actions produce arbitrary effects [11]. Here, we simply 
assume that actions satisfy the successor state axioms in all epistemic alter- 
natives, and hence we state that there are no additional initial situations. 
So the final foundational axiom is 

(V1, 82). Init(s,) A Init(s2) A(s1 # 82) D 


> > 


(AP).FV(P, 8:) \N7nFV(P, 52). 


We refer to these seven foundational axioms as ™.. 
To characterize what is known, we need two new categories of axioms: 


Definition 7.1 Sensed Fluent Axiom 
A sensed fluent axiom of £, for an action A is a sentence of the form: 


SF(A(a1,...,8n),8) = Oa(m,...,€n, 8), 


where A is an n-ary action function symbol, and O4(%,...,%p,5) is a 
formula that is uniform in s and whose free variables are among 21,..., pn, 8. 


‘These axioms are used to characterize the conditions under which the action 
returns the binary value 1. For example, in a 1-dimensional robotics domain, 
we might have 


SF (check Bumper, s) = (da). robot Pos(x,s) Aa < 3.5 


as a way of saying that a bumper sensor on the robot returns 1 when the 
robot gets within 3.5 units of a wall located at the origin. Actions that do 
not involve any sensing can be assumed to always return 1, and hence the 


O would be TRUE. 
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Definition 7.2 Initial Knowledge Axiom 
An initial knowledge axiom of £, is a sentence of the form: 


Ko(s) = O(s), 


where O(s) is a formula that is uniform in s, and whose only free variable 
Is 8. 


Axioms of this type are used to characterize the epistemic alternatives to 
So. For example, in the 1-dimensional robotics world, we might have 


Ko(s) = (Aa). « < 10A (Vy). robot Pos(y,s) = « = y. 


as a way of saying that all the robot knows initially is that it is no further 
than 10 units away from the wall. 
Given these, a knowledge-action theory D, is of the following form: 


D. =e UDs, UD ap UPuna UD 5, UPs¢ UP, 


where, &. is the new set of foundational axioms, Ds;, Pap, Puna, and Ds, 
are as before, Ds7 is a set of sensed fluent axioms, one for each action 
function symbol, and Px, is an initial knowledge axiom. 

With such a theory in hand, we can now explain what it means to know 
a formula in an arbitrary situation by using two abbreviations. First, we 
have 


K(o1,02) “ (WP) [... > P(o1,02)] 


where the ellipsis stands for the universal closure of 


{Init(s2) A Ko(s1) ») P(s1, 82)} A 
{P(s1,82) A [Poss(a,s,) = Poss(a,s2)] A 
[SF (a, 51) = SF (a, s2)] D P(do(a, 51), do(a, s2))}. 


The formula A(s’, s) can be read as saying that s’ is an epistemic alter- 
native to situation s. It is not hard to show that as a binary relation, K is 
required to be transitive and Euclidean: 

Theorem 5 The closure of the following two formulas is logically valid: 

1. K (82, $1) A K(ss, $2) >) K(8s, $1) 

2. K (82, $1) A K(ss, $1) >) K(8s, $2) 

With respect to knowledge, this means that we are imagining an agent 
that has positive and negative introspection [8]. The above definition of 
XK also inductively specifies the epistemic alternatives after performing any 
sequence of actions: 

Theorem 6 The closure of the following two formulas is logically valid: 

1. K(s', So) = Ko(s’); 


2. K(s',do(a, s)) = (4s"). 8’ = do(a,s")A K(s", 8) A 
[Poss(a, s”) = Poss(a,s)] A[SF(a,s"”) = SF(a,s)]. 
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So for an initial situation like So, s’ is an epistemic alternative only when 
Ko(s') holds; for a non-initial situation do(a,s), an epistemic alternative is 
of the form do(a, s”) where s” is an alternative to s, and where s and s” 
agree on Poss and SF. 

If we were to consider the formula AK (—, s) as a fluent, we can see that 
the second part of the above theorem amounts to a successor state theorem 
for K, as it shows how to obtain the value of A(—, do(a, s)) as a function 
of K(—,s) and other properties of a and s. Because we define knowledge 
in terms of K below, this means we have provided a solution to the frame 
problem for knowledge: a complete specification of how knowledge changes 
as the result of performing any action. 

Finally, we have the abbreviation 


Knows(¢,¢) af (Vs). K(s,0) D 9[s] 


where ¢ is a situation-suppressed formula and o is any term of sort situation. 
This says that @ is known in situation o if it comes out true at all epistemic 
alternatives to o. Here for simplicity, we are requiring ¢ to be situation- 
suppressed; a more elaborate account would allow us to express knowledge 
about knowledge and other formulas where the ¢ mentions situations. 

To see how this would work in practice, imagine that we have an action 
theory D, for the 1-dimensional robotics world that includes the two axioms 
above and 


(Vz). robot Pos(z, So) = z= 4.0 


saying that the actual position of the robot is 4.0 units away from the wall, 
as well as a successor state axiom for robot Pos saying that it is only affected 
by a move action: 


(Vz). robot Pos(z, do(move(«), s)) = robot Pos(z + 2, 8). 


Then, letting WallClose(c) stand for the abbreviation 


(dx). a <3.5A (Vy). robot Pos(y,o) =a = y. 


we get the following as logical consequences of D_: 


aK nows(WallClose, So) 
aK nows(AWallClose, So) 


aK nows(WallClose, do(move(—1), So)) 
aK nows(AWallClose, do(move(—1), So)) 


Knows(WallClose, do(check Bumper, do(move(—1), Sp))) 
Knows(WallClose, do(move(—1), do(check Bumper, do(move(—1), So )))) 


aK nows(WallClose, do(move(1), do(check Bumper, do(move(—1), So )))) 
aK nows(AWallClose, do(move(1), do(check Bumper, do(move(—1), So)))). 
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